Content-based Dynamic Email Spam Detecting Using Fuzzy Granular Computing Approach

نویسنده

  • Saber Salehi
چکیده

Spam detection is a significant problem which is considered by many researchers by various developed strategies. The best and main spam detection technique should consider and scan the content of the messages to find spam. This research concerns the development of the certain category of granular computing as a classifier for spam detection. In this research, Fuzzy Granular Computing Classification Algorithm (F_GrCCA) will be used beside the clustring algorithm to provide powerful framework for classification of emails as spam or non-spam. In fact, the core structure of this framework will be constructed by selected clustring algorithm of data mining. Then, in the secondary structure F_GrCCA will be used to reduce the likelihood of the classification errors at the area of high overlap between the classes and cover a substantial number of patterns which do not belong to any cluster. In this study, the performance of proposed technique will be measured by several evaluation criteria as: spam recall (Rs), spam precision (Ps), accuracy which deals with false positive, false negative and FB measure. The effectiveness of fuzzy granular computing in spam detecting will be analaysed and compared with Naive Bayes Classifier according to achieved accuracy. These algorithms were trained and tested on a set of 4601 email messages in which 1813 were spams and 2788 were non-spams. The experiments will be performed based on different training set size and extracted feature size.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Hybrid Approach for Email Spam Detection based on Scatter Search Algorithm and K-Nearest Neighbors

Because cyberspace and Internet predominate in the life of users, in addition to business opportunities and time reductions, threats like information theft, penetration into systems, etc. are included in the field of hardware and software. Security is the top priority to prevent a cyber-attack that users should initially be detecting the type of attacks because virtual environments are not moni...

متن کامل

A New Model for Email Spam Detection using Hybrid of Magnetic Optimization Algorithm with Harmony Search Algorithm

Unfortunately, among internet services, users are faced with several unwanted messages that are not even related to their interests and scope, and they contain advertising or even malicious content. Spam email contains a huge collection of infected and malicious advertising emails that harms data destroying and stealing personal information for malicious purposes. In most cases, spam emails con...

متن کامل

A Genetic Based Approach to Optimize The Fuzzy Clustering Spam Filters

Spam email, is the practice of frequently sending unwanted email messages, usually with commercial content, in large quantities to a set of indiscriminate email accounts. Effort has been put into solving the spam problem from many directions. We examine the use of an optimizing technique to detect the best value of the Fuzzy Clustering Parameters which are the number of clusters and the Fuzzifi...

متن کامل

A Fuzzy Clustering Approach to Filter Spam E-Mail

Spam email, is the practice of frequently sending unwanted email messages, usually with commercial content, in large quantities to a set of indiscriminate email accounts. However, since spammers continuously improve their techniques in order to compromise the spam filters, building a spam filter that can be incrementally learned and adapted became an active research field. Researches employed m...

متن کامل

A Voice Spam Filter to Clean Subscribers' Mailbox

With the growing popularity of VoIP and its large customer base, the incentives of telemarketers for voice spam has been increasing in the recent years. If the threat of voice spam remains unchecked, it could become a problem as serious as email spam today. Compared to email spam, voice spam will be much more obnoxious and time consuming nuisance for telephone subscribers to filter out. In this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013